Randomized rule selection in transformation-based learning: a comparative study

نویسندگان

  • Sandra Carberry
  • K. Vijay-Shanker
  • Andrew Wilson
  • Ken Samuel
چکیده

Transformation-Based Learning (TBL) is a relatively new machine learning method that has achieved notable success on language problems. This paper presents a variant of TBL, called Randomized TBL, that overcomes the training time problems of standard TBL without sacri cing accuracy. It includes a set of experiments on part-of-speech tagging in which the size of the corpus and template set are varied. The results show that Randomized TBL can address problems that are intractable in terms of training time for standard TBL. In addition, for language problems such as dialogue act tagging where the most e ective features have not been identi ed through linguistic studies, Randomized TBL allows the researcher to experiment with a large set of templates capturing many potentially useful features and feature interactions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NEW CRITERIA FOR RULE SELECTION IN FUZZY LEARNING CLASSIFIER SYSTEMS

Designing an effective criterion for selecting the best rule is a major problem in theprocess of implementing Fuzzy Learning Classifier (FLC) systems. Conventionally confidenceand support or combined measures of these are used as criteria for fuzzy rule evaluation. In thispaper new entities namely precision and recall from the field of Information Retrieval (IR)systems is adapted as alternative...

متن کامل

Comparative Analysis of Machine Learning Algorithms with Optimization Purposes

The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches‎. ‎Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data‎. ‎In this paper‎, ‎a methodology has been employed to opt...

متن کامل

On New Methods of Dynamic Ensemble Selection Based on Randomized Reference Classifier

In the paper two dynamic ensemble selection (DES) systems are proposed. Both systems are based on a probabilistic model and utilize the concept of Randomized Reference Classifier (RRC) to determine the competence function of base classifiers. In the first system in the selection procedure of base classifiers the dynamic threshold of competence is applied. In the second DES system, selected clas...

متن کامل

Comparative Impact of In-Presence and Distance Learning Methods on Knowledge Improvement in the Healthcare Workers

Background and Objectives: In continuous medical learning programs, selection of appropriate learning methods is important for streaming individuals by learning process. The goal of this study is to compare in-presence learning method with distance learning method and explore their effect on the knowledge of health workers (practical nurses) in terms of reduced expenses.   Methods: The study ...

متن کامل

A Comparative Study of Various Data Transformation Techniques in Data Mining

This research paper presents a technique to select an ideal transformation technique of original and transformed features. The paper reviews about a comparative study of various data transformation techniques used in data mining which includes six types of transformation techniques Wavelets, Genetic Algorithm and Wrappers, Identity transform, Program synthesis, Data refinement transformation, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Natural Language Engineering

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2001